238 research outputs found
Learning Social Navigation from Demonstrations with Conditional Neural Processes
Sociability is essential for modern robots to increase their acceptability in
human environments. Traditional techniques use manually engineered utility
functions inspired by observing pedestrian behaviors to achieve social
navigation. However, social aspects of navigation are diverse, changing across
different types of environments, societies, and population densities, making it
unrealistic to use hand-crafted techniques in each domain. This paper presents
a data-driven navigation architecture that uses state-of-the-art neural
architectures, namely Conditional Neural Processes, to learn global and local
controllers of the mobile robot from observations. Additionally, we leverage a
state-of-the-art, deep prediction mechanism to detect situations not similar to
the trained ones, where reactive controllers step in to ensure safe navigation.
Our results demonstrate that the proposed framework can successfully carry out
navigation tasks regarding social norms in the data. Further, we showed that
our system produces fewer personal-zone violations, causing less discomfort
Multi-Object Graph Affordance Network: Enabling Goal-Oriented Planning through Compound Object Affordances
Learning object affordances is an effective tool in the field of robot
learning. While the data-driven models delve into the exploration of
affordances of single or paired objects, there is a notable gap in the
investigation of affordances of compound objects that are composed of an
arbitrary number of objects with complex shapes. In this study, we propose
Multi-Object Graph Affordance Network (MOGAN) that models compound object
affordances and predicts the effect of placing new objects on top of the
existing compound. Given different tasks, such as building towers of specific
heights or properties, we used a search based planning to find the sequence of
stack actions with the objects of suitable affordances. We showed that our
system was able to correctly model the affordances of very complex compound
objects that include stacked spheres and cups, poles, and rings that enclose
the poles. We demonstrated the applicability of our system in both simulated
and real-world environments, comparing our systems with a baseline model to
highlight its advantages
Iterative methodology on locating a cement plant
In this study, a cement plant location was determined by considering essential parameters such as the locations of resources and their importance in the manufacturing process. A crucial mathematical problem, named Weber problem, reinforced the decision of the method of allocating the factory. Additionally, not only the limitations of the cement production but also the importance weights of goods used in the manufacturing were taken into account in the iterative methodology in order to answer the engineering question via the mathematical problem. As a result, by optimizing the case through the iterations introduced in the paper, the location of the cement plant was set. Hence several losses such as extra travel distances and time wasting in transportation were minimized.No sponso
Learning Multi-Object Symbols for Manipulation with Attentive Deep Effect Predictors
In this paper, we propose a concept learning architecture that enables a
robot to build symbols through self-exploration by interacting with a varying
number of objects. Our aim is to allow a robot to learn concepts without
constraints, such as a fixed number of interacted objects or pre-defined
symbolic structures. As such, the sought architecture should be able to build
symbols for objects such as single objects that can be grasped, object stacks
that cannot be grasped together, or other composite dynamic structures. Towards
this end, we propose a novel architecture, a self-attentive predictive
encoder-decoder network with binary activation layers. We show the validity of
the proposed network through a robotic manipulation setup involving a varying
number of rigid objects. The continuous sensorimotor experience of the robot is
used by the proposed network to form effect predictors and symbolic structures
that describe the interaction of the robot in a discrete way. We showed that
the robot acquired reasoning capabilities to encode interaction dynamics of a
varying number of objects in different configurations using the discovered
symbols. For example, the robot could reason that (possible multiple numbers
of) objects on top of another object would move together if the object below is
moved by the robot. We also showed that the discovered symbols can be used for
planning to reach goals by training a higher-level neural network that makes
pure symbolic reasoning.Comment: 7 pages, 7 figure
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
Offline Reinforcement Learning (RL) methods leverage previous experiences to
learn better policies than the behavior policy used for data collection. In
contrast to behavior cloning, which assumes the data is collected from expert
demonstrations, offline RL can work with non-expert data and multimodal
behavior policies. However, offline RL algorithms face challenges in handling
distribution shifts and effectively representing policies due to the lack of
online interaction during training. Prior work on offline RL uses conditional
diffusion models to represent multimodal behavior in the dataset. Nevertheless,
these methods are not tailored toward alleviating the out-of-distribution state
generalization. We introduce a novel method named State Reconstruction for
Diffusion Policies (SRDP), incorporating state reconstruction feature learning
in the recent class of diffusion policies to address the out-of-distribution
generalization problem. State reconstruction loss promotes generalizable
representation learning of states to alleviate the distribution shift incurred
by the out-of-distribution (OOD) states. We design a novel 2D Multimodal
Contextual Bandit environment to illustrate the OOD generalization and faster
convergence of SRDP compared to prior algorithms. In addition, we assess the
performance of our model on D4RL continuous control benchmarks, namely the
navigation of an 8-DoF ant and forward locomotion of half-cheetah, hopper, and
walker2d, achieving state-of-the-art results.Comment: 8 pages, 7 figure
High-level Features for Resource Economy and Fast Learning in Skill Transfer
Abstraction is an important aspect of intelligence which enables agents to
construct robust representations for effective decision making. In the last
decade, deep networks are proven to be effective due to their ability to form
increasingly complex abstractions. However, these abstractions are distributed
over many neurons, making the re-use of a learned skill costly. Previous work
either enforced formation of abstractions creating a designer bias, or used a
large number of neural units without investigating how to obtain high-level
features that may more effectively capture the source task. For avoiding
designer bias and unsparing resource use, we propose to exploit neural response
dynamics to form compact representations to use in skill transfer. For this, we
consider two competing methods based on (1) maximum information compression
principle and (2) the notion that abstract events tend to generate slowly
changing signals, and apply them to the neural signals generated during task
execution. To be concrete, in our simulation experiments, we either apply
principal component analysis (PCA) or slow feature analysis (SFA) on the
signals collected from the last hidden layer of a deep network while it
performs a source task, and use these features for skill transfer in a new
target task. We compare the generalization performance of these alternatives
with the baselines of skill transfer with full layer output and no-transfer
settings. Our results show that SFA units are the most successful for skill
transfer. SFA as well as PCA, incur less resources compared to usual skill
transfer, whereby many units formed show a localized response reflecting
end-effector-obstacle-goal relations. Finally, SFA units with lowest
eigenvalues resembles symbolic representations that highly correlate with
high-level features such as joint angles which might be thought of precursors
for fully symbolic systems
- …